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Abstract 

We define in the space of n x m matrices of rank n, n < m, 
the condition Riemannian structure as follows: For a given matrix A 
the tangent space at A is equipped with the Hermitian inner prod- 
uct obtained by multiplying the usual Frobenius inner product by 
the inverse of the square of the smallest singular value of A denoted 
a n (A). When this smallest singular value has multiplicity 1, the func- 
tion A — ► log(a n (A)~ 2 ) is a convex function with respect to the con- 
dition Riemannian structure that is i — > log(a n (A(t))~ 2 ) is convex, in 
the usual sense for any geodesic A(t). In a more abstract setting, a 
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function a defined on a Riemannian manifold (A4, (,)) is said to be 
self-convex when loga(j(t)) is convex for any geodesic in (M.,a (,)). 
Necessary and sufficient conditions for self-convexity are given when 
a is C 2 . When a(x) = d(x,J\f)~ 2 where d(x,Af) is the distance from 
x to a C 2 submanifold J\f C W we prove that a is self-convex when 
restricted to the largest open set of points x where there is a unique 
closest point in N to x. We also show, using this more general notion, 
that the square of the condition number ||^4|| F /a n (A) is self-convex 
in projective space and the solution variety. 



1 Introduction 

Let two integers 1 < n < m be given and let us consider the space of matrices 
K™ xm , j£ _ j£ or q equipped with the Frobenius Hermitian product 

(M,N) F = trace (N*M) = ^ m ir n~ r 

Given an absolutly continuous path A(t), a < t < b, its length is given by 
the integral 



r 


dA{t) 


1 a 


dt 



and the shortest path connecting A (a) to A(b) is the segment connecting 
them. Consider now the problem of connecting these two matrices with the 
shortest possible path in staying, as much as possible, away from the set of 
"singular matrices" that is the matrices with non-maximal rank. 

The singular values of a matrix A G K" xm are denoted in non-increasing 
order: 

<Ti(A) > ... > <r n -i{A) > a n (A) > 0. 

We denote by GL njjn the space of matrices A e K nxm with maximal rank : 
rank A = n, that is <J n (A) > so that the set of singular matrices is 

J\f = K nxm \ GL n>m = {A e K nxm : cr n (A) = 0} . 

Since the smallest singular value of a matrix is equal to the distance from 
the set of singular matrices: 

&n(A) = d F (A,Af) = min \\A — S\\ F , 
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given an absolutly continuous path A(t), a < t < b, we define its "condition 
length" by the integral 

<7 n (A(£)) -1 cft. 

F 

A good compromise between length and distance to M is obtained in mini- 
mizing L K . We call "minimizing condition geodesic" an absolutly continuous 
path, parametrized by arc length, which minimizes L K in the set of absolutly 
continuous paths with given end-points and condition distance d K (A,B) be- 
tween two matrices the length L K of a minimizing condition geodesic with 
endpoints A and B, if any. 

In this paper our objective is to investigate the properties of the smallest 
singular value a n (A(t)) along a condition geodesic. Our main result says 
that the map log (a n (A(t))^ 1 ) is convex. Thus a n (A(t)) is concave, and its 
minimum value along the path is reached at one of the endpoints. 

Note that a similar property holds in the case of hyperbolic geometry 
where instead of K nxm we take W 1 ^ 1 x [0, oo [, instead of J\f we have lR n_1 x {0}, 
and where the length of a path a{t) = (ai(t), . . . ,a n (t)) is defined by the 
integral 



/ 


da(t) 




dt 



Geodesies in that case are arcs of circles centered at R" _1 x {0} or segments 
of vertical lines, and log (a ri (t)~ 1 ) is convex along such paths. 

The approach used here to prove our theorems is heavily based on Rie- 
mannian geometry. We define on GL n m the following Riemannian structure: 

(M,N) KA = a n (Ay 2 Re(M,N) F 

where M, N e K nxm and A e GL„ im . The minimizing condition geodesies 
defined previously are clearly geodesic in GL„ jm for this Riemannian struc- 
ture so that we may use the toolbox of Riemannian geometry. In fact things 
are not so simple: the smallest singular value cr n (A) is a locally Lipschitz 
map in GL„ im , and it is smooth on the open subset 

GL> m = {Ae Gh n ,m ■■ <Jn-i(A) > a n (A)} 

that is when the smallest singular value of A is simple. On the open subset 
GL> the metric (•, -) K defines a smooth Riemannian structure, and we call 
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"condition geodesies" the geodesies related to this structure. Such a path is 
not necessarily a minimizing geodesic. Our first main theorem establishes a 
remarkable property of the condition Riemannian structure: 

Theorem 1. a~ 2 is logarithmically convex on GL> m i.e. for any geodesic 
curve j(t) in GL> m for the condition metric the map log (<r~ 2 (7(t))) is con- 
vex. 

Problem 1. The condition Riemannian structure (., .) is defined in GL„ im 
where it is is only locally Lipschitz. Let us define condition geodesies in GL„ im 
as the extremals of the condition length L K (see for example |5j/ Chapter 4, 
Theorem 4-4-3, for the definition of such extremals in the Lipschitz case). Is 
Theorem U\ still true for GL„ )m ? All the examples we have studied confirm 
that convexity holds, even if a~ l {^{t)) fails to be C 1 . See Boito-Dedieu f^. 
We intend to address this issue in a future paper. 

In a second step we extend these results to other spaces of matrices: the 
sphere § r (GL> m ) of radius r in GL> m in Corollary [HI the projective space 
P (GL> m ) in Corollary [3 We also consider the case of the solution variety 
of the homogeneous equation M( = that is the set of pairs 

{(M,C) G K nx(n+1) x K n+1 : M( = 0} . 

Now our function a is the square of the condition number studied by Demmel 
in [1]. This is done in the affine context in Theorem [3] and in the projective 
context in Corollary [HI 

Since a n (A) is equal to the distance from A to the set of singular matrices 
a natural question is to ask whether our main result remains valid for the 
inverse of the distance from certain sets or for more general functions. 

Definition 1. Let (A4, (•, •)) be Riemannian and let a : M. —>■ K. be a function 
of class C 2 with positive values. Let M. R be the manifold M. with the new 
metric 

called condition Riemann structure. We say that a is self-convex when 
loga(7(i)) is convex for any geodesic 7 in M. K . 

For example, with M. = {x = (xi, . . . , x n ) el™ : x n > 0} equipped with 
the usual metric, a{x) = x~ 2 is self-convex. The space M. K is the Poincare 
model of hyperbolic space. 
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In the following theorem we prove self-convexity for the distance function 
to a C 2 submanifold without boundary Af C W . Let us denote by 

p(x) = d(x,Af) = min \\x — y\\ and a(x) = —p-^- 
yeAf P\ x ) 

Let IA be the largest open set in W such that, for any x ElA, there is a unique 
closest point in Af to x. When IA is equipped with the new metric a(x) (., .) 
we have: 

Theorem 2. The function a : U \ Af —>■ R is self-convex. 

Theorem [2] is then extended to the projective case. Let Af be a C 2 sub- 
manifold without boundary of P(R J ). Let us denote by oIr the Riemannian 
distance in projective space (points in the projective space are lines throught 
the origin and the distance oIr between two lines is the angle they make). 
Let us denote dp = smdn (this is also a distance), define a(x) = dp(x,Af)~ 2 , 
and let IA be the largest open subset of P(R J ) such that for x E IA there is a 
unique closest point from Af to x for the distance dp. Then 

Corollary 1. The map a : IA \Af —>■ R is self-convex. 

The extension of Theorem [1] and Theorem [2] to other types of sets or 
functions is not obvious. In Example [T] we prove that a (A) = ai(A)~ 2 + 
• • • + a n (A)~ 2 is not self-convex in GL n)m . 

In Example [2] we take Af = R 2 , and IA the unit disk so that IA contains a 
point (the center) which has many closest points from Af. In that case the 
corresponding function a : U\Af — > R is self-convex but it fails to be smooth 
at the center of the disk. 

In Example [3] we provide an example of a submanifold Afcl 2 such that 
the function a(x) = d(x,Af)~ 2 defined on R 2 \ Af is not self-convex. 

Our interest in considering the condition metric in the space of matrices 
comes from recent papers by Shub [8] and Beltran-Shub [lj where these 
authors use condition length along a path in certain solution varieties to 
estimate step size for continuation methods to follow these paths. They give 
bounds on the number of steps required in terms of the condition length of the 
path. If geodesies in the condition metric are followed the known bounds on 
polynomial system solving are vastly improved. To understand the properties 
of these geodesies we have begun in this paper with linear systems where we 
can investigate their properties more deeply. We find self-convexity in the 
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context of this paper remarkable. We do not know if similar issues may 
naturally arise in linear algebra even for solving systems of linear equations. 
Similar issues do clearly arise when studying continuation methods for the 
eigenvalue problem. 

2 Self-convexity 

Let us first start to recall some basic definitions about convexity on Rieman- 
nian manifolds. A good reference on this subject is Udri§te 

Definition 2. We say that a function f : M. — > R is convex whenever 

f(j xy {t))<(l-t)f{x)+tf{y) 
for every x, y G M., for every geodesic arc j xy joigning x and y and < t < 1. 

The convexity of / in Ai is equivalent to the convexity in the usual sense 
of / ° ixy on [0, 1] for every x,y G U and the geodesic j xy joining x and y 
or also to the convexity of g o 7 for every geodesic 7 ([9] Chap. 3, Th. 2.2). 
Thus, we see that 

Lemma 1. Self- convexity of a function a : At — > R is equivalent to the 
convexity of log o« in the condition Riemannian manifold Ai K . 

When / is a function of class C 2 in the Riemannian manifold Ai, we 
define its second derivative D 2 f(x) as the second covariant derivative. It is 
a symmetric bilinear form on T x Ai. Note ([£ Chapter 1]) that if x G M. and 
x G T x Ai, and if 7(i) is a geodesic in Ai, 7(0) = x, ^7(0) = x, then 

D 2 f(x)(x,x) = ^(fo 1 )(0). 

This second derivative depends on the Riemannian connection on M.. Since 
M. is equipped with two different metrics: (.,.) and (-,-) K we have to dis- 
tinguish between the corresponding second derivatives; they are denoted by 
D 2 f(x) and D 2 f(x) respectively. No such distinction is necessary for the 
first derivative Df(x). 

Convexity on Riemannian manifold is characterized by (see [9] Chap. 3, 
Th. 6.2): 
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Proposition 1. A function f : Ai — > R 0/ c/ass C 2 zs convex if and only if 
D 2 f(x) is positive semide finite for every x G M.. 

We use this proposition to obtain a caracterisation of self-convexity: a 
is self-convex if and only if the second derivative _D 2 (log oa)(x) is positive 
semi definite for any x G M K . We get 

Proposition 2. For a function a : M. — > R 0/ c/ass C 2 wjffi positive values 
self-convexity is equivalent to 

2a{x)D 2 a{x){x,x) + \\Da(x) || 2 ||i|| 2 - 4{Da(x)x) 2 > 

/or any x e M. and for any vector x G T X A4, the tangent space at x. 

Proof. Let x G M be given. Let : R m — > M be a coordinate system such 
that tp(0) = x and with first fundamental form 5^(0) = 8^ (Kronecker's 
delta) and Christoffel's symbols r* fc (0) = 0, and let 

A = ao ip 



so that a(x) = A(0). Those coordinates are called "normal" or "geodesic". 
Note that this implies 

f)n- 

-(0) = 



dgi 3 



dz k 

for all k. We denote by g K ^ and 1^, - fc respectively the first fundamental 
form and the Christoffel symbols for ip in M K . Let us compute them. Note 
that 

g K ,ij(z) =gij(z)A(z), 



dg K ,ij 



(0) = Dg K!ij (0)(e k ) = D( 9ij A)(0)(e k ) 



Moreover, 



dA 

9ij (0)DA(0)(e k ) + A(0)D 9ij (0)(e k ) = $^ — {0). 

oz k 



1 r i 1 / 9g K ,ij , s dg K)ik , . dg K j k 



K,,, - m ^ - ^ ^(0) + -f-(o) - -^(0) 



1 



2A(0) 



( 8A r)A r)A \ 
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That is, 

^rejj = 2A(0) SijW' ^ 

■^Kjfc = otherwise. 
The second derivative of the composition of two maps 

M -A R ^ R 
is given by the identity (see [Sj Chap. 1.3, Hessian) 

D\ij o f)(x) = Dij(f(x))D 2 f(x) + i>"(f(x))Df(x) ® D/(a;) 
and where Df(x) (g> Df(x) is the bilinear form on T^TW by 

0D/(z) ® Df(x))(u,v) = Df{x){u)Df(x)(v). 
This gives in our context, that is when f = a and ■?/> = log, 

D 2 (logoa)(x) = —-—D 2 a(x) --^-rDafx) ® Da(x). 

a(x) a(xj 2 

According to Proposition [1] our objective is now to give a necessary and 
sufficient condition for _D^(log oa)(x) to be positive semidefinite for each 
x G M. In our system of local coordinates the components of D 2 a(x) are 
(see Chap. 1.3) 

A d*A y ri dA _ d 2 A 

while the components of D 2 a(x) are 

K ' 3k ~ d Zj dz k ^ K ' jk d Zi - 

If we replace the Christoffel symbols in this last sum by the values previously 
computed we obtain, when j = k, 

Vr ^ = r* M + rr M_ 
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2A\dz~) ~2Aj^\dz~) 'A\dz~) ~2A^Vdi~y 



while when j ^ k, 



KjkQ KjkQ T K , jk g 



1 dA dA 1 dA OA 1 <9A «9A 
+ 



2A dzk dzj 2 A dzj dzu A dzj dzk 
Both cases are subsumed in the identity 

Kjk dzi~ Adzjdz k 2A^\dz i ) ' 

Putting together all these identities gives the following expression for the 
components of D^(\ogoa)(x): 

D k a (logoa)(,), fe = - ^_ - - — — + - ^ (^-j j 

d 2 A „ ^/<9A\ 2 



2 A 2 I dzjdzk *—^\dzi) dzj dz k 
Thus, -D 2 (log oa)(i) > if and only if 

2a(x)D 2 a(x) + \\Da(x)\\ 2 x (., .) x - ADa{x) ® Da(x) 
is positive semi-definite, that is when 

2a{x)D 2 a{x){x,x) + \\Da{x)\\ 2 x \\x\\ 2 x - A(Da{x)x) 2 > 
for any x G M. and for any vector x G T x Ai. This finishes the proof. □ 



An easy consequence of Proposition [2] is the following. See also Example 

13 

Corollary 2. When a function a : M. —>■ R o/ c/ass C 2 zs self-convex then 
any critical point of a has a positive semi- definite second derivative D 2 a(x). 
Such a function cannot have a strict local maximum or a non- degenerate 
saddle. 



9 



Proposition 3. The following condition is equivalent for a C 2 function a = 
1/p 2 : M. — > R to be self-convex on Ai: For every x G M. and x G T X A4, 

||x|| 2 ||Dp(x)|| 2 - (Dp(x)x) 2 - p(x)D 2 p(x)(x,x) > 0, 

or, what is the same, 

2\\x\\ 2 \\Dp(x)\\ 2 > D 2 p 2 (x)(x,x). 

Proof. Note that 

Da(x)x = Dp(x)x, 
p(x) 6 

6 2 

D 2 a(x)(x, x) = (Dp(x)x) 2 i—^tD 2 p(x)(x, x). 

p(xp P\ x ) 

Hence, the necessary and sufficient condition of Proposition [2] reads 

'2 ; " --,{Dp{x)x) 2 +—-,{Dp{x)x) 2 r ^D 2 p(x)(x,x) > 0, 

p(xf p(x) &y K ' ' p(x) eK rK ' ' p(xf y Jy ' 

and the proposition follows. □ 

Corollary 3. Each of the following conditions is sufficient for a function 
a = 1/p 2 : M. — > R to be self-convex at x G M.: For every x G T X A4, 

D 2 p(x){x,x) < 0, 

or 

\\D 2 p 2 (x)\\<2\\Dp(x)\\ 2 . 

In the following proposition we obtain a weaker condition on a to obtain 
convexity in Ai K instead of self-convexity. 

Proposition 4. a(x) is convex in M. K if and only if 

2a(x)D 2 a(x)(x, x) + ||Z)a(x)|| 2 ||x || 2 - 2(Da{x)x) 2 > 0, 
for any x G M. and any vector x G T X A4 . 

Proof. We follow the lines of the proof of Proposition [2] with ip equal to the 
identity map instead of ip = log. □ 
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3 Some general formulas for matrices 



Proposition 5. Let A = (£, 0) G GL> m , where E = diag (o"i > ■ ■ • > 
> a n ) G K nx ™. T/ie map a n : GL> m — > R is a smooth map and, for 
every U G K nx "\ 



nn ) i 



Da n (A)U = Re(u, 

D 2 a 2 n (A)(U, U) = 2 E™ =1 Kf - 2 Y2~=\ 

Proof. Since is an eigenvalue of A4* with multiplicity 1, the implicit 
function theorem proves the existence of smooth functions o-^(B) G R and 
u(B) G K ra , defined in an open neighborhood of A and satisfying 

BB*u{B) = a 2 n (B)u(B), 
IK5)|| 2 = 1, 

u(A) = e n = (0,...,0,l) T eP, 
°M = °l- 

Differentiating these equations at B gives, for any U G K raxm , 

(UB* + BU*)u{B) + BB*u(B) = {<J 2 n )' u{B) + a 2 n {B)u{B), 
u(B)*u(B) = 

with = Du(B)U and = Da 2 n (B)U . Pre-multiplying the first 

equation by u(B)* gives 

u(B)*(UB*+BU*)u(B)+u(B)*BB*u(B) = (a 2 n )' u(Byu(B)+a 2 l (B)u(B)*u(B) 
so that 

Da 2 n (B)U = (a 2 n )' = 2Re(u(ByUB*u(B)) 

and 

^" (B)f/ " o-jB) • 

The derivative of the eigenvector is now easy to compute: 

Du{B)U = u(B) = (a 2 n (B)I n - BB*)\UB* + BU* - (a 2 )' I n )u{B) 

where (a 2 (B)I n — BB*Y denotes the generalized inverse (or Moore-Penrose 
inverse) of a 2 n {B)I n - BB*. 
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The second derivative of a 2 at B is given by 

D 2 al(B)(U,U) = 2Re(u(B)*UB*u(B)+u(B)*UU*u(B)+u(B)*UB*u(B)) 

2Re(u(B)*UU*u(B) + u(B)*(UB* + BU*)u(B)) = 2Re(u(B)*UU*u(B)^ 

u(B)*(UB* + BU*)(a 2 (B)I n - BB*)*(UB* + BU* - (<%)' I n )u{B)). 

Using u(A) = e n and a n {A) = a n we get 

Da 2 n {A)U = 2Re{UA*) nn = 2a n Re{u nn ), 
Da n (A)U = Re(u nn ), 

and the second derivative is given by 

D 2 a 2 n (A)(U,U) = 

(n-l 
(UU*) nn + J2(UA* + AU*) nk (a 2 n - 4)~\UA* + AU* - (a 2 n )' I n ) kn 
k=l 

2Re ( (UUlnn + £ \( UA * + AU p*A = 2 J2 \ Unj f-2 V \™« + *f*\ 

□ 

Corollary 4. Let A = (E, 0) G GL> m , where S = diag (a 1 > ■ ■ ■ > a n _i > 
o n > 0) E K nxn . Let us define p(A) = a n (A)/ \\A\\ F . Then, for any U G 
K nxm such that Re (A, U) F = 0, we have 

Dp(A)U = «g->, _ 

DV(Am u) = ^ f (e- , m 2 - e^i 1 K -y-y-' 2 - gj**) ■ 

Proof. Note that 

p( } ~ PS ~ PC ' 

and the first assertion of the corollary follows from Proposition [5j For the 
second one, note that h = h\jh 2 (for real valued C 2 functions h, hi, h 2 with 
h 2 (0) ^ 0) implies 



D 2 h 



h\D 2 h x - h 1 h 2 D 2 h 2 - 2h 2 Dh 1 Dh 2 + 2h l (Dh 2 ) 2 

hi ' 



'2 

Now, p 2 (A) = a 2 (A)/\\A\\ 2 F , D(\\A\\ 2 F )U = 2Re(A, U) F = 0, D 2 (\\A\\ 2 F )(U, U) - 
2\\U\\ F , and D 2 o~ 2 (A)(U, U) is known from Proposition [5j The formula for 
D 2 p 2 (A) follows after some elementary calculations. □ 
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4 The affine linear case 



We consider here the Riemannian manifold Ai 
usual Frobenius Hermitian product. Let a : GL> 
a(A) = l/al(A). 



GL> m equipped with the 
-> K be defined as 



> 



Corollary 5. TTie function a is self-convex in GL^ 

Proof. From Proposition [3J it suffices to see that 

2\\U\\%\\Da n (A)\\ 2 F >D 2 a 2 n (A)(U,U). 

Since unitary transformations are isometries in GL> m with respect to the 
condition metric we may suppose, via a singular value decomposition that 
A = (£, 0) G GL> m , where £ = diag {a x > ■ ■ ■ > a n _ ± > a n ) G K nxn . Now, 
the inequality to verify is obvious from Proposition as H-Dcr^A)!!^ = 1 
and 



n-l 



D*o-l(A)(U,U) = 2j2 



u 



■a j i 



! - 2 E 



fc=i 



«ni| 2 < 2||C/|| 



2 



□ 



Corollary 6. Lei r > 0. T/ie function a is self-convex in the sphere § r (GL>, 
of radius r in GL> m . 



Proof. It is enough to prove that any geodesic in (§ r 



JL> m ), a) is also a 



geodesic in (GL>, 



Indeed, suppose that A and -B are matrices in 



§ r .(GL> m ) and the minimal geodesic in (GL> m , a) between A and B is X(t), 

rX(t) 



a <t <b. Then we claim that L K ( 
rX(t) 



IX 



dX(t) 
dt 



so that 



/ r 2 



\\x 
d 

di 



< L K (X(t)). Indeed, for any t, 
X(t)Re((X(t), d -^) F ) 

\\x(t)\\l 



rX(t) 



dX(t) 
dt 



\x 



+ 



r 2 Re((X(t),^) F y 



\X 



dt 
4 
F 



2r*Re((X(t), d -^) F y 



1/2 



IX 
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/ r 2 



dX{t) 
dt 



r 2 Re({X(t),^) F ) 



A 



1/2 



Hence, 



IX 



d ( rX(t) 



\X 



< 



dX(t) 
dt 



J 



X 



dt V X 



rX(t) 



\X 



d ( rX(t) 



dt V \\X 



\X(t)\\ F a-' (X(t)) 



d ( rX(t) 



dt \ X 



< °n (X(t)) 



dX{t) 




dX(t) 


dt 


F 


dt 



Therefore X(t) can only be a minimizing geodesic if it belongs to § r (GL> m ). 
Since all geodesies are locally minimizing geodesies, Corollary [6] follows. □ 

The following gives an example of a smooth and non- self convex function 
in GL riim . 

Example 1. For n > 3, the function a(A) = a\(A)~ 2 + • • • + cr n (A)~ 2 is not 
self-convex in GL„ m . 

Proof. For simplicity we consider the case of real square matrices. We have 
a(A) = \\A-Y F , 

Da(A)A = -2(A~ X , A~ l AA~ l ) F = -2{A~ T A' 1 A~ T , A) F , 

\\Da{A)\\ F = A\\A- T A- l A- T \\l, 

D 2 a{A){A, A) = 2\\A- 1 AA- 1 \\ 2 F + 4(A _1 , A~ l AA~~ l AA^ 1 ) F . 

According to Proposition HI the self-convexity of a(A) in GL ra is equivalent 
to 



r \2\\A~'AA-'\\ F + A(A-\A- l AA- v AA- v ) F J + 

A\\A\\ 2 F \\A- T A- l A- T \\ 2 F - 8(A-\ A- l AA- l ) 2 F > 
This inequality is not satisfied when 



A 














l 


: 


: 


1 





and A = | 


1 







\o 





2) 




V o 





0/ 



□ 
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5 The homogeneous linear case 



5.1 The complex projective space. 

The matter of this subsection is mainly taken from Gallot-Hulin-Lafontaine 
© sect. 2.A.5. 

Let V be a Hermitian space of complex dimension dime V — d + 1. We 
denote by F(V) the corresponding projective space that is the quotient of V\ 
{0} by the group C* of dilations of V; P( V) is equipped with its usual smooth 
manifold structure with complex dimension dimP(K) = d. We denote by p 
the canonical surjection. 

Let V be considered as a real vector space of dimension dim R V = 2d + 2 
equipped with the scalar product Re (., .) v . The sphere S(V) is a submanifold 
in V of real dimension 2d + 1. This sphere being equipped with the induced 
metric becomes a Riemannian manifold and, as usual, we identify the tangent 
space at z G S(V) with 

T z S(V) = {ueV : Re(u,z) v = 0} . 

The projective space P(V) can also be seen as the quotient §(V)/S' 1 of 
the unit sphere in V by the unit circle in C for the action given by (A, z) G 
S 1 x S(V) — » Xz G S(V). The canonical map is denoted by 

p v : S(V) -> p(y). 

is the restriction of p to S(V). 

The horizontal space at z G S(V) related to py is defined as the (real) 
orthogonal complement of kei Dpv(z) in T Z S(V). This horizontal space is 
denoted by H z . Since V is decomposed in the (real) orthogonal sum 

V = Rz © Riz © z" 1 

and since ker Dpy(z) = WLiz (the tangent space at z to the circle S l z) we get 

H z = z ± = {u G V : (u, z) = 0} . 

There exists on P(V) a unique Riemannian metric such that pv is a 
Riemannian submersion that is, py is a smooth submersion and, for any 
z G S(V), Dpy(z) is an isometry between if 2 and TJ,( Z )P(V). Thus, for this 
Riemannian structure, one has: 

(Dp v (z)u, Dpv(z)v) Tp{z)¥{v) = Re(u,v) v 
for any z G S(V) and u,d£ i7 z . 



15 



Proposition 6. Let z E S(V) be given. 

1. A chart at p(z) G P(V) is defined by 

ip z : H Z ^¥(V), <p z (u)=p(z + u). 

2. Its derivative at is the restriction of Dp(z) at H z : 

D<p z (0) = Dp(z) : H z -> T P(Z) ¥(V) 
which is an isometry. 

3. For any smooth mapping ip : P(V) — > R 7 and /or any v £ H z we have 

Dip{p{z)) {Dp(z)v) = D(ip o (p z )(0)v 

and 

D 2 iP(p(z))(Dp(z)v, Dp(z)v) = D\iP o <p z )(p)(v,v). 

Proof. 1 and 2 are easy. We have D(ip o v? z )(0) = Dip(p(z))D((p z )(0) which 
gives 3 since -D(y? z )(0)t> = Dp{z)v for any i> e if z . For the second derivative, 
recall that D 2 ifj(p(z))(Dp(z)v, Dp(z)v) = (ip o 7)"(0), where 7 is a geodesic 
curve in P(V) such that 7(0) = p(z),"/'(0) = Dp(z)v. Now, consider the 
horizontal p v — lift 7 of 7 to S(V) with base point z. Note that 7(0) = 
z, 7'(0) = v. Hence, 

O°7)"(0) = (^opo 7 )"(0) = D 2 (^oj9)( 2 )(t;,t;) + ^(p(z))Dp(z)7"(0). 

As 7"(0) is orthogonal to T Z S(V), we have Dp{z)Y{0) = 0. Finally, 

D 2 (^op)(z)(v,v) = &op(z+tv))"(0) = (^o^(to))"(0) = £ 2 (V>o^)(0)M, 

and the assertion on the second derivative follows. □ 

The following result will be helpful. 

Proposition 7. Let M\,M2 be Riemannian manifolds and a 2 : M2 —> 
]0, 00 [ be of class C 2 . Let n : M\ — > M2 be a Riemannian submersion. 
Let U2 Q M.2 be an open set and assume that a± = 0:2 o n is self-convex in 
U\ = it~ 1 (U2)- Then, a 2 is self-convex in 1^2- 
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Proof. Let -M K; i be .Mi, but endowed with the condition metric given by a\, 
and let -M Kj2 be A4 2 , but endowed with the condition metric given by a 2 - 
Then, it : -M Kj i — ► M. K ,2 is also a Riemannian submersion. 

Now, let 72 : [a, b] — ► W2 C .M Ki 2 be a geodesic, and let 71 C .M^i be its 
horizontal lift by 7T. Then, 71 is a geodesic in U\ C .Mi (see [SI Cor 2.109]) 
and hence logai(7i(i)) is a convex function of t. Now, 

log(a 2 (72(*))) = log(a 2 o 7r(7i(t))) = log(a a (7(t))), 

is convex as wanted. □ 

Corollary 7. The function a 2 : P(GL> J -> R, a 2 (A) = ||A|||(7- 2 (A) 
self-convex in P(GL> m ). 

Proof. Note that p : S(GL> m ) — > P(GL> m ) is a Riemannian submersion 
and ct2 = a o p where a is as in Corollary [SI The corollary follows from 
Proposition [7J □ 

5.2 The solution variety. 

Let us denote by p\ and p 2 the canonical maps 

Si n P (K nx(n+1) ) and § 2 ^ P (K n+1 ) = P„(K), 

where Si is the unit sphere in K nx ( n+1 ) and S2 is the unit sphere in K n+1 . 
Consider the affine solution variety, 

yV> = {(M, C) G Si x S 2 : Me GL> n+1 and M( = 0} . 

It is a Riemannian manifold equipped with the metric induced by the product 
metric on K nx ( n+1 ) x K ra+1 . The tangent space to VV > is given by 

T ( m,c)W > = I (M, C) G T M Si x T ? S 2 : M( + M( = o} . 

The projective solution variety considered here is 

W> = {(pi(M),pa(C)) e P (K nx ( n+1 )) x P n (K) : M e GL> n+1 and M( = 0} , 

that is also a Riemannian manifold equipped with the metric induced by the 
product metric on P (K nx ( n+1 )) x P n (K). 

Let us denote by ~K\ the restriction to >V > of the first projection Si x S 2 — > 
Si, and by R : VV > —>R,R = a n o z v We have 
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Lemma 2. Let w = (M, () G W > and let 7 be a geodesic in W > ; 7(0) = w. 
Then, 

Da- n (7r 1 H)(7no 7 )"(0)<0. 

Proof. Our problem is invariant by unitary change of coordinates. Hence, 
using a singular value decomposition, we can assume that M = (E, 0) G 
GL> n+1 , where E = diag (01 > • • • > cr n _i > <7 n ) G K nxn and £ = e n+ i = 
(0, . . . , 0, 1) T G § 2 - As 7 = (M(t), C(t)) is a geodesic of W> C K nx ( n+1 ) x K n , 
7"(0) is orthogonal to T W W, which contains all the pairs of the form ((A, 0), 0) 
where A is a n x n matrix, Re(S, A) = 0. Hence, M"(0) has the form 

M"(0) = (a£,*), 

for some real number a G R. Finally, M(t) is contained in the sphere so 
||M(t)|| F = land 

= (||M(t)||!0"(O) = 2||M / (0)||| + 2Re(M(0),M"(0)) = 2||M'(0)||| + 2a, 
so that a = -||M'(0)||| and (M"(0)) nn = -||M' (0)|||.<T n . From Proposition 

El 

D<7 n (7riH)(7n o 7 )"(0) = Re((vr 1 o 7 )"(0) nn ) = Re(M"(0)) nn < 0. 

□ 

Theorem 3. The map a : VV > —> R given by oi(M,Q = cr n (M) _2 is self- 
convex. 

Proof. Using unitary invariance we can take M = (S,0) G GL> +1 , where 
E = diag (<ti > • ■ • > o-„_i > t7„) G K nxn and C = e n+ i = (0, . . . , 6, 1) T G § 2 - 
According to proposition [3] we have to prove that 

2 \\DR(w)\\ 2 > D 2 R 2 (w)(w,w) 

for every w G VV > and w G T U ,VV > . From Proposition O we have 

DR{w)w = Da n (TTi(w))(Diri(w)w) = Re(D7ii(w)w) nn , 

so that ||£)i2(w)|| = 1. On the other hand, assume that w 7^ and let 7 be 
a geodesic in W > , 7(0) = 10,7(0) = w. From Lemma [21 

D 2 R 2 (w)(w, w) = {al o TTi o 7 )"(0) = 
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D 2 al{'K 1 {w)){D^{w)w, Dn x {w)w) + 2a n Da n {<K 1 {w)){<K l o 7 )"(0) < 
D 2 al(n 1 (w))(Dn 1 (w)(w),D7r 1 (w)(w)). 
Thus, we have to prove that for y e K nx ( n+1 ), 

2\\y\\ 2 >D 2 a 2 n (7r 1 (w))(y,y). 

which is a consequence of our Proposition [5j □ 

Corollary 8. The map a 2 : W> -> R given fry a a (M, C) = ||M|||/a^(M) 
self-convex. 

Proof. Consider the Riemannian submersion 

Pl x p 2 : Si x § 2 — > P (K" x (" +1 )) x P n (K) , pi x p 2 (M, C) = (pi(M),p 2 (C)). 

Note that T(M,f)W > contains the kernel of the derivative D(p± x p 2 )(M, £). 
Thus, the restriction p\ x p 2 : VV > — > W > , is also a Riemannian submersion. 
The corollary follows combining Proposition [7] and Theorem [2J □ 

6 Self-convexity of the distance from a sub- 
manifold of W 

Let M be a C fc submanifold without boundary A/" C W , k > 2. Let us denote 
by 

p(x) = d(x,Af) = inf \\x — y\\ 

the distance from J\f to x e M- 7 (here <i(x, y) = ||x — y|| denotes the Euclidean 
distance). Let W be the largest open set in W such that, for any x G U, there 
is a unique closest point from Af to x. This point is denoted by K{x) so that 
we have a map defined by 

K:U^M, p(x) = d(x,K(x)). 

Classical properties of p and K are given in the following (see also Foote [5] , 
Li and Nirenberg [7j). 

Proposition 8. 1. p is defined and 1—Lipschitz on W , 
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2. For any x G U , x — K{x) is a vector normal to Af at K{x) i.e. x — 

K(x) e {T K{x) Aff, 

3. K is C k ' x on U, 

4- p 2 is C k onU, Dp 2 (x)x = 2 (x — K(x),x) and D 2 p 2 (x)(x, x) = 2||i|| 2 — 
2(DK(x)x,x) 

5. p is C k onU\ M, 

6. (DK(x)x, x) > for every i£W and x G MP. 

Proof. 1. For any x and y one has p(x) = d(x,K(x)) < d(x,K(y)) < 
d(x, y) + d(y, K{y)) = d(x, y) + p(y). Since x and y play a symmetric 
role we get \p(x) — p(y)\ < d(x,y). 

2. This is the classical first order optimality condition in optimization. 

3. This classical result may be derived from the inverse function theorem 
applied to the canonical map defined on the normal bundle to N 

can : NjV — > W , can(y, n) = y + n, 

for every y G Af and n G N y Af = (T y Af) ± . The normal bundle is a C k ~ 1 
manifold, the canonical map is a C k ^ 1 diffeomorphism when restricted 
to the set {(y, n) : y + tn G U, V < t < 1} and K(x) is easily given 
from can -1 . 

4. The derivative of p 2 is equal to Dp 2 (x)x = 2 (x — K(x),x — DK(x)x) = 
2 (x — K(x),x) because DK(x)x G T K ^Af and x— K(x) G (T KtyX )Af) ± . 
Thus Wp 2 (x) = 2{x-K{x)) is C k ~ x on W so that p 2 is The formula 
for D 2 p 2 follows. 

5. Obvious. 

6. Let x{t) be a curve in U with x(0) = x. Let us denote = x(t), 

^1 = x(t), = K(x(t)), ^ = y(t) and ^ = y(t). From the 
first order optimality condition we get 

(x(t)-y(t),y(t)} = 
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whose derivative at t = is 

(x-y,y) + (x-y,y) = 0. 

Thus 



(DK(x)x, x) = (y, x) = {y, y) - (x - y, y) . 

This last quantity is equal to |^ ||x — y u ■ 
the second order optimality condition. 



. It is nonnegative by 

t=o 

□ 



Proof of Theorem [2] and Corollary [TJ, We are now able to prove our 
second main theorem. Let us denote a(x) = l/p(x) 2 . We shall prove that a 
is self-convex on U. From proposition [3] it suffices to prove that, for every 
x G W, 

2||x|| 2 ||Dp(x)|| 2 > D 2 p 2 (x){x,x) 
or, according to Proposition 04 and \\Dp\\ = 1, that 



2\\x\\ z > 2\\x\\* - 2{DK(x)x,x) . 

This is obvious from Proposition [8)4. 

Now we prove Corollary [TJ Let S^IR- 7 ) be the sphere of radius 1 in W> 
and let p^j denote the canonical projection p^j : W — > P(R- ? ). Note that the 
preimage of Af by p^ satisfies 

d{y,p-J{AT)) = d w (p&(y),jV)\\y\\. 

As in the proof of Corollary [HI the mapping 1/ p(x) 2 is self-convex in the set 
§i(M J ) r\pZj(U). Now, apply Proposition [7] to the Riemannian submersion 
to conclude the corollary. □ 



Two examples. 

Example 2. Take U the unit disk in M. 2 and M the unit circle. The corre- 
sponding function is given by 

a(x) = d(x,Af)- 2 = 1/(1- ||a;||) 2 . 

According to Theorem^ the map loga(x) is convex along the condition 
geodesies in 

U\{(0,0)} = {x EM 2 : 0<||x||<l}. 
21 



This property also holds inU: a geodesic through the origin is a ray x(t) = 
(— 1 + e*)(cos 9, sin 9) when — oo < t < 0, and x(t) = (1 — e~*)(cos#, sin 9) 
when < t < oo for some 9. In that case 

log a(x(i)) = 2 \t\ 

which is convex. 

Example 3. Take M C M 2 equal to the union of the two points (— 1, 0) and 
(1, 0). In that case 

a^xY 1 = d(x,Af) 2 = min ((1 + x^ 2 + xj, (1 - Xi) 2 + x 2 2 ) . 

It may be shown that for any < a < 1/10, the straight line segment is 
the only minimizing geodesic joining the points (0, — a) and (0, a). Since 
loga(0,t) = — log(l + t 2 ) has a maximum at t = 0, g(t), —a <t<a, cannot 
be log-convex. Here {0} x R is equal to the locus in M 2 of points equally 
distant from the two nodes which is the set we avoid in Theorem^ 
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